home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.lang.c
- Path: eso.org!news
- From: Nicolas Devillard <nDevil@eso.org>
- Subject: Re: C-function for string similarity
- X-Nntp-Posting-Host: mc14
- Content-Type: text/plain; charset=us-ascii
- Message-ID: <313C4368.2FDE@eso.org>
- Sender: news@eso.org
- Content-Transfer-Encoding: 7bit
- Organization: ESO - European Southern Observatory, Garching by Munich
- References: <4hgsg2$95s@kralle.zdv.Uni-Mainz.DE>
- Mime-Version: 1.0
- Date: Tue, 05 Mar 1996 14:36:40 +0100
- X-Mailer: Mozilla 2.0 (X11; I; SunOS 5.4 sun4m)
-
- Robert Mueller wrote:
- >
- > Hallo,
- >
- > I am looking for a clever C-function (or LISP, PROLOG etc) taking two
- > strings as input and "evaluating" the measure of similarity between the
- > strings. For example, if the
- > strings are identical, the function should return 1, if they share no
- > characters at all, it should return 0. If they share some characters, and
- > these characters have a "comparable" position and order in this string,
- > the function should return something between 0 and 1, indicating the
- > degree of similarity.
- >
- > Any pointer to libraries or code with such a clever function are welcome.
- > Thank you very much in advance.
- >
- > Robert
-
-
- Problem is : what do you call similarity ?
- There is virtually an infinite number of ways to measure a distance
- between 2 objects, binary or not. See some books about pattern
- matching for that sake. Roots are taken from mathematical topology
- in this field, you may want to have a look at that first.
-
- Then maybe there exists C-code to implement this :)
-
- Hope it helps,
- Nicolas
-